Collaborative Track Analysis, Data Cleansing, and Labeling
نویسندگان
چکیده
Tracking output is a very attractive source of labeled data sets that, in turn, could be used to train other systems for tracking, detection, recognition and categorization. In this context, long tracking sequences are of particular importance because they provide richer information, multiple views, wider range of appearances. This paper addresses two obstacles to the use of tracking data for training: noise in the tracking data and the unreliability and slow pace of hand labeling. The paper introduces a criterion for detecting inconsistencies (noise) in large data collections and a method for selecting typical representatives of consistent collections. Those are used to build a pipeline which cleanses the tracking data and employs instantaneous (shotgun) labeling of vast numbers of images. The shotgun labeled data is shown to compare favorably with hand labeled data when used in classification tasks. The framework is collaborative – it involves a human-in-the loop but it is designed to minimize the burden on the human.
منابع مشابه
Cleansing and preparation of data for statistical analysis: A step necessary in oral health sciences research
In many published articles, there is still no mention of quality control processes, which might be an indication of the insufficient importance the researchers attach to undertaking or reporting such processes. However, quality control of data is one of the most important steps in research projects. Lack of sufficient attention to quality control of data might have a detrimental effect on the r...
متن کاملTrack Xplorer: A System for Visual Analysis of Sensor-based Motor Activity Predictions
With the rapid commoditization of wearable sensors, detecting human movements from sensor datasets has become increasingly common over a wide range of applications. To detect activities, data scientists iteratively experiment with different classifiers before deciding which model to deploy. Effective reasoning about and comparison of alternative classifiers are crucial in successful model devel...
متن کاملMulti-view News Video Topic Tracking Approach
Existing researches on tracking topics of news videos require lots of labeled examples. However, video labeling is too time-consuming to generate a large number of labeled videos in real applications. In this paper, a novel approach is proposed to track news video topics through using only a few labeled samples. The three main characters of proposed approach are: (1) Multi-view learning process...
متن کاملTrack detection on the cells exposed to high LET heavy-ions by CR-39 plastic and terminal deoxynucleotidyl transferase (TdT)
Background: The fatal effect of ionizing radiation on cells depends on Linear Energy Transfer (LET) level. The distribution of ionizing radiation is sparse and homogeneous for low LET radiations such as X or γ, but it is dense and concentrated for high LET radiation such as heavy-ions radiation. Material and Methods: Chinese hamster ovary cells (CHO-K1) were exposed to 4 Gy Fe-ion 2000 keV/...
متن کاملInvestigation of the Talent Identification Factors and Talent Education Development on Track & Field's Athletes in Iran
Objective: The purpose of current investigation is studding of talent identification factors and education talent development on the track & field's athletes in Iran by views of experts, coaches and athletes.Methodology: The descriptive research method was used in current research. The number of 150 experts, coaches and athletes were recruited in this research. The number of 6 variables and 45...
متن کامل